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ALZHEIMER ' 8 DISEASE THERAPEUTICS 

The field of the invention is Alzheimer's disease 
therapeutics . 

5 pqcfrground of the Bnvwtlop 

Alzheimer's disease (AD) is a progressive 
degenerative disorder of the brain that afflicts over 
four million people in the United States. No effective 
treatment is available. The most characteristic change 
10 observed upon post-mortem histopathological analysis of 
AD-aff licted brain tissue is the presence of neuritic and 
cerebrovascular plaques containing dense deposits of 
amyloid protein (Selkoe, Cell 58:611-612, 1989). /9~ 

amyloid is a 39-43 amino acid peptide (Glenner and Wong, 

* » 

15 biochem. biophys. Res. Commun. 120:885-890, 1984; Masters 
et al., Proc. Natl. Acad. Aci. USA 82:4345-4249, 1985) 
synthesized as part of a larger precursor protein 
referred to as amyloid precursor protein (APP) , which is 
known to have a number of isoforms in humans (APP 695 , Kang 

20 et al.. Nature 325:733-736, 1987; APP 751 , Ponte et al.. 
Nature 331:525-527, 1988, and Tanzi et al., Nature 
331:528-530, 1988; and APP 770 , Kitaguchi et al., Nature 
331:530-532, 1988). The amino terminal of 0-amyloid is 
generated by cleavage of a peptide bond of APP which in 

25 APP 695 lies between Het596 and Asp597. 

Although structural alterations of APP are 
implicated in the pathogenesis of Alzheimer's disease, it 
remains unknown how they cause the disease. No 
biological function for APP has been identified, although 

30 there is evidence that APP has a receptor-like 

architecture (Kang et al., Nature 325:733-736, 1987; 
Ponte et al.. Nature 331:525-527, 1988; Tanzi et al., 
Nature 331:528-530, 1988; Kitaguchi et al.. Nature 
331:530-532, 1988), is located on the neuronal surface 

35 (Dyrks et al., EHBO J. 7:949-957, 1988), and possesses an 
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evolutionarily conserved • cytoplasmic domain (Yamada et 
al., Biochea. Biophys. Res. Commun. 149:665-671, 1987), 




The methods and therapeutical compositions of the 
5 invention are based upon the discovery, described in 
detail below, that APP forms a complex with G Q , a major 
GTP-binding protein (or n G protein") in brain. Like all 
G proteins, a molecule of G D is made up of one a subunit 
and one fiy subunit. Two isof orms of G of known as G ol (or 

10 G^) and G o2 (or G oB ) , have been identified; they have 
slight amino acid differences in their a subunits, and 
are together referred to herein as G 0 . The cDNA sequence 
and deduced amino acid sequence of the a subunits of each 
of G o1 and G o2 (as reported by Strathmann et al., Proc. 

15 Natl. Acad. Sci. USA 87:6477-6481, 1990) are shown in 
Fig. 4a (SEQ ID NO: 2) and Fig. 4b (SEQ ID NO: 28) , 



The finding that APP associates with G G is 
consistent with related findings concerning other 

20 G proteins, as disclosed in a second application 

(XJSSN ) having the same inventor and filing 

date as the present application, which second application 
is herein incorporated by reference. The cytoplasmic 
APP 695 sequence His 657 -Lys 676 (SEQ ID NO: 1) possesses a 

25 specific G 0 -activating function, and is necessary for 
complex formation of this APP with G Q ; this sequence, 
sometimes referred to as the "couplone* region of APP, is 
completely conserved in APP 751 and APP 770 , as well as in 
mouse APP 695 . This provides evidence that APP is a 

30 receptor coupled to G c , and suggests that abnormal APP-G G 
signalling is involved in the Alzheimer's disease 
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The invention includes a method of identifying a 
therapeutic useful for treating or preventing Alzheimer's 
disease, which method includes the steps of 

contacting (a) a first molecule containing the 
5 couplone portion of APP (SEQ ID HO: 1) with (b) a second 
molecule containing the amino acid sequence of G Q (SEQ ID 
NO: 2) or an APP-associating region of G Q (SEQ ID Nos: 3, 
4, or 5), in the presence of a candidate compound; and 
either (i) determining whether the candidate 

10 compound interferes with (i.e. r inhibits partially or 
completely) the association of the first and second 
molecules, or (ii) determining whether the candidate 
compound interferes with the activation of the second 
molecule by the first molecule, such interference being 

15 an indication that the candidate compound is a potential 
therapeutic useful for treating or preventing Alzheimer's 
disease. The determining step may be accomplished by, 
for example, immmunoprecipitating the first molecule with 
an antibody specific for APP, and detecting the presence 

20 or amount of the second molecule which co-precipitates 
with the first molecule . Alternatively, the second 
molecule can be immunoprecipitated with an antibody 
specific for G D , following which the presence or amount 
of the first molecule which co-precipitates with the 

25 second molecule is determined. Where activation is the 
criterion being measured, the determination step may be 
accomplished by contacting the second molecule with a 
substrate which is or includes GTP or an analog of GTP 
[such as GTPyS or Gpp(NH)p], and detecting or measuring 

30 the binding of the substrate to the second molecule, 
wherein such binding is evidence of activation of the 
second molecule by the first molecule. In preferred 
embodiments, the contacting step is carried out in a 
cell-free system; the Mg 2 " 1 " concentration at which the 

35 contacting step is carried out is between approximately 
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ixlCT 7 and lxlO~ 2 M, and the first molecule includes the 
cytoplasmic tail portion of APP 695 from residues 649 to 
695 (SEQ ID NO: 6) and/or the membrane-spanning portion 
of APP 695 from residues 639 to 648 (SEQ ID NO: 7) (the 
5 entire membrane-spanning segment of APP 695 being from 
residues 625 to 648, SEQ ID NO: 8); the first molecule 
more preferably includes substantially all of APP (SEQ ID 
NO: 9) • (Alternatively, the corresponding functional 
regions of APP 751 or APP 770 , or any other APP, may be 
10 used.) The second molecule preferably contains two or 
three of the putative APP-associating regions referred to 
above r and may also contain one, or more of the GTP- 
binding regions of G 0 , corresponding to residues 35 to 50 

(SEQ ID NO: 10), residues 201 to 218 (SEQ ID NO: 29), or 

15 residues 263 to 274 (SEQ ID NO: 30) of G o1 [Kaziro, 

"Structure of the genes coding for the a subunits of G 
proteins", Ch. 1 in ADP-ribosvlatincr Toxins and G 
proteins (Moss, J., and Vaughan, M. eds.) ppl89-206, 
American society for Microbiology, Washington, D.C. 

20 (1988)], and more preferably contains substantially all 
Of G D (SEQ ID NO: 2). 

The invention also includes a system (e.g., a 
cell-free in vitro system) for screening candidate 
Alzheimer's disease therapeutics, which system includes a 

25 first polypeptide containing a sequence essentially 
identical to that of peptide 20 (SEQ ID NO: 1) , and a 
second polypeptide containing a sequence essentially 
identical to one, two or three of the putative APP- 
associating regions of G Q (SEQ ID NOs: 3, 4, and 5); the 

30 system may also include a means for detecting either (a) 
the association of the first polypeptide with the second 
polypeptide, or (b) the activation of the second 
polypeptide by the first polypeptide. The first 
polypeptide may conveniently be anchored to a solid - 

35 material (e.g., a cellular membrane, a polystyrene 
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surface, or a standard matrix material) , or may be in a 
phospholipid vesicle. It may include a sequence 
essentially identical to the mezibrane-spanning region of 
AFP, and/or a sequence essentially identical to the 
5 entire cytoplasmic tail of APP. The second molecule 
preferably contains the GTP-binding domain of G Q , and 
more preferably contains the entire sequence of G 0 . 

The invention also features a method for 
diminishing the activation of G 0 in a neuronal cell by 

10 treating the cell with a compound, such as a peptide 
fragment of G Q or of the cytoplasmic tail of APP, which 
blocks association of neuronal G Q with, and/ or activation 
of neuronal G c by, the cytoplasmic tail of APP. The cell 
may be so treated in vivo (i.e., in an animal, e.g. a 

15 mammal such as a human or other primate, cow, horse, pig, 
sheep, goat, dog, cat, rat, mouse, guinea pig, hamster, 
or rabbit) or in vitro. This method may be used to 
prevent or treat the symptoms of Alzheimer's disease in a 
patient. Such a compound may include, for example, a 

20 peptide having fewer than 50 amino acids (preferably 40 
or fewer, and more preferably 30 or fewer) , and 
containing the sequence of peptide 20. Also within the 
invention is a DNA molecule (e.g., a plasmid or viral 
DNA) encoding such a peptide, and a therapeutic 

25 composition containing, in a pharmaceutical ly acceptable 
carrier, either the peptide or the DNA molecule. 

In another aspect, the invention features a method 
for identifying a ligand for which APP is a receptor, 
which method includes the steps of 

30 providing an APP molecule, the cytoplasmic tail of 

which is accessible to a molecule of G 0 ; 

contacting a candidate compound with the 
extracellular domain of the APP molecule; and 

detecting either (a) association of G c with the 

35 APP molecule, (b) dissociation of G Q from the APP 
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molecule, or. (c) activation of G 0 by the APP molecule, 
such association, dissociation, or activation being 
evidence that the candidate compound is a ligand of APP. 

Other features and advantages of the invention 
will be apparent from the detailed description set forth 
below, and from the claims. 

Brief Dascrlpfclon of the Drawings 



Fig. 1(a) is a schematic diagram illustrating the 
structural organization of APP. The hatched box contains 
10 the sequence of the /5/A 4 protein; the black box contains 
the so-called "Peptide 20" or couplone sequence; filled 
circles are N-glycosylation sites. The numbers designate 
amino acid sequence numbers corresponding to APP 695 . 

Pig. 1(b) is a bar graph illustrating the effects 
15 of synthetic APP peptides on G D . In (b) , (d) , (e) and 
(f), values represent the mean ±S.E. of three 



20 



Pig. 1(c) is a graph illustrating the time course 
of the action of peptide 20 on 6 0 . Values represent the 
mean of three experiments. Since the S.B. was < 5% of 
each value in this figure, the error bars are not 
indicated. 

Pig. 1(d) is a graph illustrating the effects of 
peptide 20 variants on G„. 
25 pig. i( e ) i s a graph illustrating the effect 

linkage with a transmembrane region has on the action of 
peptide 20 on G Q . 

Fig. 1(f) is a graph illustrating the effect of 
pertussis toxin on peptide 20-induced stimulation of GTP- 
30 yS binding to G 0 . 

Figs. 2a-2d is a set of SDS-PAGE gels analyzed by 
immunoblotting, which illustrate the immunoprecipitation 
of APP and G 0 by an ant i -APP antibody from brain 
membranes, (a) Immunoprecipitation of APP by 22C11. 
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(b) Innnunoprecipitation of 6 0 by 22C11. (c) Effect of 

Mg 2+ on the innnunoprecipitation of G Q by 22C11. 

(d) Effect of peptide 20 on 22C11- induced precipitation 

k 

of G M (left) and APP (right) « Each of the results 
5 presented in this figure was reproduced at least three 
times. 

Fig. 3a is a schematic diagram of the construction 
method used to prepare recombinant mutant APP cDNAs. 
Regions labeled ATG, TAA, TGA signify original 
10 translation and termination sites and a newly inserted 
termination site, respectively. 

Fig. 3b is a schematic diagram comparing the 
structures of authentic APP 695 and the two recombinant 
mutant APP polypeptides, AN and AC. 
15 Fig. 3c is an immunoblot analysis of Sf9 membranes 

using anti-Alz 90, 1C1, and 465. 

Fig. 3d is an immunoblot analysis of the 22C11- 
precipitate from an Sf 9 membrane-G 0 reconstitution 



20 Fig. 3e is an immunoblot illustrating dissociation 

of 6 Q from APP by activation of 6 0 . Each of the results 
presented in Figs. 3c-e was reproduced at least three 



Fig. 4a is the cDNA sequence and deduced amino 
25 acid sequence of G ol a ( Strathmann et al., Proc. Natl. 

Acad. Scl. USA 87:6477-6481, 1990) (SEQ ID NO: 2). 

Fig. 4b is the cDNA sequence and deduced amino 
acid sequence of G o2 a (Strathmann et al.) (SEQ ID NO: 28). 

Detailed Description 
30 It was previously shown that the insulin-like 

growth factor II receptor (IGF-IIR) couples directly to 
the G protein referred to as G L (Nishimoto et al. , J. 
Biol. Chem. 264:14029-14038, 1989) via a 14-residue 
section of the cytoplasmic tail of IGF-IIR, Arg 2410 -Lys 2423 
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(Okamoto et al., Cell 62: 709-717 , 1990; Okamoto et al., 
Proc. Natl* Acad. Scl. U.S.A. 88:8020-8023, 1991). The 
structural determinants for the G i -activating function in 
IGF-IIR were defined as (i) two basic residues at the N- 
5 terminal region of the amino acid sequence, and (ii) a C- 
terminal motif of B-B-X-B or B-B-X-X-B (where B is a 
basic residue and X is a non-basic residue) (Okamoto et 
al., Cell 62:709-717, 1990). To assess whether APP might 
function as a 6 protein-coupled receptor, the amino acid 
10 sequence of human APP695 was examined for regions of less 
than 26 residues which satisfy (i) and (ii) . The 

■ 

sequence His 657 -Lys 676 is the only such region in the 
cytoplasmic domain of APP695. In two other isoforms of 
APP, APP751 (Ponte et al., Nature 331:525-527, 1988; Tanzi 
15 et al., Nature 331:528-530, 1988) and APP770 (Kitaguchi et 
al.. Nature 331:530-532, 1988), as well as in mouse APP695 
(Yamada et al., Biochem. Blophys. Res. Connnun. 149:665- 
671, 1987), this sequence is completely conserved. 



Preparation of peptides 

20 A peptide corresponding to the His 657 -Lys 676 region 

Of APP [ HHG WEVDAAVTP EERHLSK (SEQ ID NO: 1) ] was 

synthesized and purified by standard methods using solid 
phase synthesis; this peptide is referred to as 
"peptide 20". Similarly prepared were peptides 
25 corresponding to other regions of APP 695 : 

APP(l-lO), MLPGLALLLL (SEQ ID NO: 11); 
APP(597-606) , DAEFRHDSGY (SEQ ID NO: 12); 
APP(677-695) , MQQNGYENPTYKFFEQMQN (SEQ ID NO: 13); 
and APP(639-648) , TVIVITLVML (SEQ ID NO: 7), a portion of 
30 the transmembrane region of APP; 

as well as the following variants of peptide 20: 

HGWEVDAAVTPEERHLSK (H-deleted, SEQ ID NO: 14) ; 
GWEVDAAVTPEERHLSK (HH-deleted, SEQ ID NO: 15) ; 
HHGWEVDAAVTPEE (RHLSK-deleted, SEQ ID NO: 16) ; 
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KQYTSIHHGWEVDAAVTPEERHLSK (KQYTSI-added, SEQ ID NO: 17) ; 

and TVIVITLVMLH HGVVEVDAAVTPEERHLSK ( transmembrane region- 

■ 

connected peptide 20; SEQ ID NO: 18). 
Peptides were purified by HPLC to greater than 95% 
5 purity, and were used immediately after synthesis. 

» 

Materials and Methods. 

' Trimeric G 0 was purified to homogeneity from 

9 

bovine brain as described (Katada et al., FEBS Lett. 
213:353-358, 1987). This G 0 preparation was stored in 20 

10 mM Hepes/NaOH (pR 7.4), 1 AM EDTA, and 0.7% CHAPS, and 
diluted £ 10 fold for assays. Gj^, which was used in 
combination with 1.5-fold concentrated G/9y (Okamoto et 
al., Natl. Acad. Sci. U.S.A. 88:8020*8023, 1991), was 
prepared as described by Morishita et al. , Biochim. 

15 Biophys. Acta 161:1280-1285, 1989. Low molecular weight 
G proteins were prepared as described by Matsui et al., 
J. Biol. Chem. 263:11071-4, 1988; GjSy was purified from 
bovine brain as set forth in Katada et al., FEBS Lett. 
213:353-358, 1987. 

20 GTPyS binding to G D was assayed in a buffer 

containing 50 mM Bepes/NaOH (pH 7.4), 100 /JM EDTA, 120 MM 
MgCl 2 , and 60 nM [ 35 S]GTPyS (DuPont-New England Nuclear) 
at 37 °C, and the fraction of total G Q bound to GTPyS was 
measured as described (Okamoto et al., Cell 62:709-717, 

25 1990) . GTPyS binding to peptides was negligible. The 
total amount of G D in a given preparation was defined as 
the saturation amount of GTPyS bound to G 0 following a 
30-min incubation of G Q with 10 mM Mg 2 * and > 60 nM GTPyS 
at 30°C. 

30 Reconstitution of G 0 into phospholipid vesicles 

was accomplished with 1 mg/ml of phosphatidylcholine, 
using the gel filtration method (Nishimoto et al., J. 
Biol. Chem. 264:14029-14038, 1989). In a final 
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incubation for GTPyS binding, 5 nM of reconstituted G Q 
was used. 

For experiments exploring the effect of Mg* + , the 
Mg 2 * concentration was set by using Mg-EDTA buffer 
5 (Birnbaumer et al., J. Eur. J- Biochem. 136:107-112, 
1983). 

Bovine brain membranes, prepared as described 
(Katada et al., FEBS Lett. 213:353-358, 1987) and 
suspended in buffer A [10 mH Hepes/KaOH (pH 7.4), 1 mM 

10 EDTA, 10 mM acetic acid, and 250 mM sucrose, plus a 

mixture (termed "PAL") of 2 mM PMSF, 20 /xg/ml aprotinin, 
and 20 fiM leupeptin] , were centrif uged and the pellet was 
solubilized for 1 h at 4°C in buffer B (10 mM Hepes/NaOK 
(ph 7.4), 1 mM EDTA, 120 mM NaCl, 0.5% CHAPS, and PAL) • 

15 Following centrifugation of the material at 15000 rpm for 
1 h, the supernatant (500 jug protein, unless specified) 
was incubated in buffer C (20 mM Hepes/NaOH (pH 7.4), 1 
mM EDTA, 120 mM NaCl, and PAL) and 2% BSA with 22C11- 
coated protein G-Sepharose, which had been prepared by 

20 incubating protein G-Sepharose (Pharmacia) with anti-APP 
monoclonal antibody 22C11 (Boehringer Mannheim) for 1 h 
at 4°C. An antibody concentration of £ 2 pg/ml was found 
to saturate precipitation of APP and G of so 2 /xg/ml was 
the concentration used for immunoprecipitation studies. 

25 As a control, 2 jig/ml of rabbit IgG was used. After 
overnight shaking at 4*C, the immunoprecipitated sample 
was centrifuged at 5000 rpm for 5 min. The pellet was 
washed three times with ice-cold buffer C and the final 
pellet was applied to SDS-PAGE. Electroblotting onto a 

30 PVDF sheet was performed as described (Okamoto et al., J. 
Biol. Chem. 266:1085-1091, 1991). After blocking with 
PBS containing 2% skim milk and 1% BSA, the sheet was 
incubated with the first antibody [1 jig/ml of 22C11; 
l/i000 dilution of anti-G c a monoclonal antibody GC/2 

35 (DuPont-New England Nuclear); 1/1000 dilution of 1C1, a 
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monoclonal antibody against the Otexminal peptide 677-695 
of APP 695 ] for 4 h f and then exposed to horseradish 
peroxidase-conjugated goat IgG reactive for mouse or 
rabbit immunoglobulins for 2-4 h at room temper ature . 
5 The antigenic bands were detected with an EGL detection 
kit (Amersham) . YL1/2 (SERA Lab) , an ant i -tubulin 
antibody, was used at 1:500 dilution for immunodetection. 

Effects of synthetic APP peptides on G proteins. 

In the experiment shown in Fig. 1(b) , 10 nH 6 0 was 

10 incubated with water or 100 pM of each peptide for 2 min, 
and the amount of GTPyS bound to G Q at the end of this 
period was measured. In the experiment shown in Fig. 
1(c), lOnM G Q was incubated with water (o) or 100 plf 
peptide 20 (SEQ ID NO: 1) (t) , and GTPyS binding was 

15 measured at the indicated times. From Fig. 1(d), it can 
be seen that peptide 20 (SEQ ID NO; 1) stimulated the 
rate constant of GTPyS binding to G Q in a dose-dependent 
manner, whereas Fig. 1(b) shows that peptides from other 
regions of APP695 were ineffective. GTfcyS binding to G 0 

20 in the presence or absence of peptide 20 (SEQ ID NO: 1) 
obeyed first-order kinetics according to the equation 

In [ (BT-B) /Br]=-* app t 
(B is the binding at time t; Br is the total binding 
observable at infinite time; and * app is the rate constant 

25 for GTPyS binding). The ability of peptide 20 (SEQ ID 
NO: 1) to activate G 0 was gradually decreased during 
storage at either -4°C or -20°C. 

Studies using structural variant peptides suggest 
that both the N-terminal basic residues and the C- 

30 terminal B-B-X-X-B motif play essential roles in the G D - 
activating function of peptide 20 (SEQ ID NO: l) [Fig. 
1(d)]. In this experiment, 10 nM G Q was incubated with 
various concentrations of HHGWEVDAAVTPEERHLSK (peptide 
20 , SEQ ID NO: 1; 0) , HGWEVDAAVTPEERHLSK (H-deleted, SEQ 
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ID NO: 14; 0) , 6WEVDAAVTPEERHLSK (HH-deleted, SEQ ID 
HO; 15; □), HH6WEVDAAVTPEE (RHLSK- deleted, SEQ ID 
NO: 16; ♦) , or RQYTSIHHGWEVDAAVTPEERHLSK (KQYTSI-added, 
SEQ ID NO: 17; ■) , and GTPyS binding to G 0 at 2 lain, was 
5 measured. Fig. 1(d) indicates which aspects of primary 
structure determine the G D -activator function of peptide 
20 (SEQ ID NO: 1) . Deletion of either one or both of the 
N-terminal His residues nullified G G -activator function 
of the peptide. The peptide (SEQ ID NO: 16) in which the 

10 C- terminal five residues of peptide 20 (SEQ ID NO: 1) has 
been deleted is several times less potent than peptide 20 
(SEQ ID NO: 1). 

As illustrated in Pig. 1(e) # G Q reconstituted in 
phospholipid vesicles was incubated with transmembrane 

15 region-connected peptide 20 

( TVTVITLVMLHHGVVEVDAAVTPEERHLSK , SEQ ID NO: 18; □) or the 
partial sequence of the APP transmembrane domain alone 
(TVIviTiiVML, SEQ ID NO: 7; □) . Transmembrane region- 
connected peptide 20 (SEQ ID NO: 18) was also incubated 

20 with G Q in the absence of phospholipids and the presence 
of 0.07% CHAPS (♦) . The transmembrane region-connected 
peptide 20 (SEQ ID NO: 18) stimulated G Q reconstituted in 
phospholipid vesicles with a potency 10 times greater 
than that of peptide 20 (SEQ ID NO: 1) . The 

25 transmembrane region alone (SEQ ID NO: 7) was without 
effect on G Q . In the absence of phospholipids, 
transmembrane region-connected peptide 20 (SEQ ID NO: 18) 
showed an effect on G Q no more potent than peptide 20 
(SEQ ID NO: 1). Therefore, the stimulatory action of 

30 this transmembrane region-connected peptide (SEQ ID 
NO: 18) is attributed to the peptide 20 (SEQ ID NO: 1) 
sequence; the potentiating effect of the transmembrane 
region may be exerted by interactions with phospholipids. 
In the experiment shown in Fig. 1(f) , ADP- 

35 ribosylation of G 0 was accomplished by incubating G Q 
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reconstituted in phospholipid vesicles with 10 /xg/ml 
preactivated pertussis toxin in the presence of 10/iM NAD 
for 15 min at 30 °C as described (Okamoto et al,, Cell 
62: 709-717 , 1990). Preactivation of pertussis toxin 
5 (Funakoshi, Japan) was carried out by treating the toxin 
with 100 mM ATP and 1 mM DTT for 10 min at 30°C. 
Heconstitution of G G into phospholipid vesicles was 
accomplished with 1 mg/ml phosphatidylcholine (Sigman, P- 
5638) at a final 6 Q concentration of 50.2 nM in a buffer 
10 containing 20 mM Hepes/NaOH (pH 7*4) , 0.1 mM EDTA, 1 mM 
DTT, and 100 mM NaCl by the gel filtration method 
(Nishimoto et al,, J. Biol. Chem. 264:14029*14038, 1989). 
In a final incubation for GTPyS binding, 5 nM of 

■ 

reconstituted G p was used. Increasing concentrations of 

15 peptide 20 {SEQ ID MO: 1) were incubated for 2 min with 
G Q reconstituted in phospholipid vesicles which had been 
treated with pertussis toxin in the presence (♦) or 
absence (□) of HAD, and GTPyS binding to G 0 was measured. 

Although peptide 20 (SEQ ID NO: 1) produced 2-3 

20 fold stimulation of GTPyS binding to G Q in the mid-range 
of Mg 2 * concentrations, the effect of peptide 20 (SEQ ID 
NO: 1) could not be observed at low (£ 100 nM) or high (> 
10 nM) Mg 2 * concentrations. 

Peptide 20 (SEQ ID NO: 1) had little effect on G 

25 proteins other than G D : G u , G^, G i3 , G s , c-Ki-ras p21 
and smg p25A were not stimulated by this peptide (data 
not shown). Thus, peptide 20 (SEQ ID NO: 1) activates G 0 
in a receptor-like manner, suggesting that APP interacts 
directly with G D through the peptide 20 (SEQ ID NO: 1} 

30 region. 

Copreoipitation of APP and G 0 

In an effort to determine whether APP is linked to 
G Q in a native membrane environment, the coprecipitation 
studies shown in Fig. 2a were performed. Solubilized 

35 membranes of bovine brain were first immunoprecipitated 
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by monoclonal anti-AFP antibody 22C11, and the 
immunoprecipitate was then probed by immunodetection with 
22C11 (lane 2) or 1C1, a monoclonal antibody against the 
C-terminal peptide 677 _ 695 of APP (SEQ ID NO: 13; Lane 4), 
5 Lanes l and 3 of Fig. 2a indicate the controls in which 
either no solubilized membranes were included (Lane 1) , 
or rabbit IgG was used for the precipitation step instead 
of antibody 22C11 (Lane 3) . In each control, 
immunodetection was performed with 22C11. The 55-kDa and 

10 25-XDa bands seen in Lanes 1 and 2 may be heavy and light 
chains of the 22C11 used for precipitation, which reacted 
with an anti-mouse IgG antibody during immunodetection. 
The precipitate by control rabbit IgG contained no 
detectable APP* Although the 100 kD molecular size of 

15 APP appears here to be slightly less than the 110-130 kD 
reported (Weidemann et al. # Cell 57:115-126, 1989), the 
precipitated form is unlikely to be an extracellular 
fragment of APP, because 1C1 recognizes this 100-kDa 
band. 

20 In the experiment illustrated in Fig. 2b, 

coprecipitation of various G proteins with APP was 
investigated. Bovine brain membrane preparations were 
immunoprecipitated with 22C11; the immunoprecipitated 
proteins were subjected to SDS-PAGE and immunoblotted 

25 with the indicated anti-G protein antisera (1/1000 
dilution). Lane 2: GC/2, anti-G Q a antiserum; lane 3: 
GC/2 plus 1 tig /ml of purified G a j lane 4: GA/1, common Got 
antiserum; lane 5: A5/7, anti-Gia antiserum; lane 6: 
MS/1, common G0 antiserum* Lane 1 shows a control 

30 immunoblot with GC/2, in which a buffer solution rather 
than the bovine brain membrane preparation was 
immunoprecipitated with 22C11. Lane 7 indicates 
immunoblotting with GC/2 of the precipitate resulting 
from imwunoprecipitation of brain membranes with control 

35 rabbit IgG, rather than 22C11. The identity of the 39- 
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kDa protein in lane 2 as G Q was verified by its absence 
in the non-membrane control (lane 1) ; by its staining 
with another G 0 a-specific antibody, aGOl (Morishita et 
al., Bur. J. Biochem. 174:7-94, 1988) (data not shown); 
5 and by a diminution of staining of this band in the 
presence of excess soluble G Q (lane 3). The 22C11- 
precipitate also contained immunoreactivity of G0 in a 
doublet at 35-36-kDa (lane 6) . The 22Cll-precipitate did 
not react with an anti-Gict antibody AS/7 (lane 5) . The 
10 antibody GA/1 detected only a 39-kDa band in the 22C11- 
precipitate (lane 4) . The control rabbit IgG 
immunoprecipitate did not produce antiH5 0 -immunoreactive 
bands corresponding to either APP or G Q (lane 7) . These 
experiments indicate that the 22Cll-precipitate from 

■ 

15 brain membranes contains APP immunoreactivity at 100 kDa, 
G C Q immunoreactivity at 39 kDa, and G0 immunoreactivity 

■ 

in a doublet at 35-36 kDa, but no detectable 
immunoreactivity indicating the presence of G^a or other 
heterotrimeric G proteins. A tubulin antibody, YL1/2, 

20 did not stain the 22Cll-precipitate (data not shown) . 

In the experiment shown in Fig. 2c, the effect 
of Jig 2 * concentration on co-precipitation of G c with anti- 
APP antibody was studied. 100 fig of solubilized brain 
membranes were precipitated by 22C11 in the presence of 

25 various Mg 2 * concentrations controlled with Ng-EDTA buffer 
(Birnbaumer et al., J. Eur. J. Biochem. 136:107-112, 
1983) . The precipitates were analyzed by immunob lotting 
with GC/2. The control lane indicates the results of 
precipitation of brain membranes by rabbit IgG followed 

30 by immunodetection with GC/2. In the absence of Mg 2 *, G Q 
was less efficiently co-precipitated by 22C11. Mg 2 * 
concentrations between 1 /iM and 1 mH resulted in maximal 
immunoprecipitation of G Q . At concentrations > 10 mM, 
relatively little G Q was precipitated. In contrast, 

35 immunoprecipitation of APP by 22C11 was not affected by 
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Mg 2 * concentration (data not shown) * These results 
indicate that, while Mg 2+ is not absolutely required for 
complex formation by APP and G Q , the concentration of Mg 24 
does strongly influence complex formation. A mid range 
5 of Mg 2+ concentration was found to facilitate APP-G Q 
association. 

Fig. 2d illustrates the results of an experiment 
indicating that peptide 20 (SEQ ID HO: 1) prevents the 
22Cil-mediated co-precipitation of 6 0 , whereas it did not 

10 affect the precipitation of APP by 22C11. In contrast, a 
control peptide (SEQ ID NO: 13) representing a segment of 
APP different from that represented by peptide 20 (SEQ ID 
NO: 1) had no discemable effect on 22Cll-mediated co- 
precipitation of G 0 . In this experiment, solubillzed 

15 brain membranes were incubated with 22011-coated beads in 
the presence of 10 fM peptide 20 (SEQ ID NO: 1; 2nd and 
5th lanes) or 10 jjM of the control peptide, peptide 677 _ 695 
of APP (SEQ ID NO: 13; 3rd and 6th lanes), or in the 
absence of both of these peptides (1st and 4th lanes) . 

20 In this experiment, an anti-mouse IgG antibody different 
from that used in (a) was employed. 

Precipitation of G Q reconstituted with rec om binant APP- 
antibody complex 

A baculovirus DNA encoding full-length APP 695 (SEQ 
25 ID NO: 9) was prepared as outlined in Fig. 3a. Authentic 
mouse APP 695 cDNA (SEQ ID NO: 9) was provided by Dr. 
Yoshiyuki Sakaki (University of Tokyo , Japan) (Yamada et 
al., Biochem. Biophys. Res. Commun. 149:665-671, 1987) in 
the vector pUC18. The Hindlll-BamHI fragment containing 
30 the entire coding region was initially subcloned into the 
vector pBR322 (pBR-APP) . A single BamHI site was 
inserted immediately before the AT6 codon of the Hindu I - 
SphI fragment. This BamHI site was inserted to permit 
efficient expression of the encoded APP protein in 
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baculovirus-infected cells. The BamHI site-inserted 
APP 695 -coding DNA (BamHI-APP 695 ) was constructed from the 
Hindlll-SphI fragment and pBR-APP, utilizing their 
internal Kpnl sites , and subcloned into pUC18. By using 
5 BanHl-APP 695 as template, two truncation mutants were 
generated and subcloned into pUC18 . These mutants 
possess an insertion of two TGA codons immediately before 
(aN) or after (aC) the peptide 20 sequence* Each BamHI- 
BamHI fragment of these respective APP-variat ion-encoding 

10 pUC18 plasmids was inserted into the baculo virus 

transfer/ expression vector pVL1393 (Invitrogen) . The 
entire region that had been through a single-stranded 
intermediate was sequenced to confirm the absence of 
unwanted nucleotide changes. New insertions were 

15 generated by oligonucleotide-directed mutagenesis with a 
kit (Takara) by the method of Kunkel et al. (Heth. 
Enzymol. 154:367-382, 1987). For the insertion of a 
BamHI site, a restriction fragment encoding the atg start 
codon was subcloned into the vector Ml3mpl8 and a single 

20 stranded template was generated. An oligonucleotide 

primer ( CCACGCAGGATCACGGGATCCATGCTGCCCAGCTTG ; SEQ ID 

NO: 19) was used to introduce GGATCC (SEQ ID NO: 20) 
immediately before the start codon. Following primer 
extension, the phage was used to transform E. coli strain 
25 JH109. Plaques were selected and single stranded DNA was 
sequenced. A restriction fragment containing the mutated 
region was subcloned into pBR-APP . For the insertion of 
the stop codons, oligonucleotide primers 

[ CAGTACACAT CCATCTGATGACATCATGG CGTGGTG (SEQ ID NO: 21) and 

* 

30 CGCCATCTCTCCAGTGATGAATGCAGCAGAACGGA (SEQ ID NO: 22)] and 

the Mi3mpl9 vector were used to introduce two sequential 
TGA stop codons. Using the method of Summers and Smith 
(Summers et al., Tex. Agric. Exp. Stn. Bull. 1555, 1987), 
baculoviruses incorporating these APP cDNAs were 
35 generated using selection by immunoblot analysis with 
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22C1X, and recovered by Infecting Sf 9 cells (invitrogen) . 
Four days after treatment of Sf9 cells with the viruses, 
cells were homogenized and suspended in buffer A. After 
the solubilization of the pellet with buffer B, the 
5 supernatant (100 pg) was mixed overnight with 22C11- 

coated protein G-Sepharose in buffer C plus 2% BSA at 4°C 
on a shaker. After centrifugation, the precipitated 
beads were incubated with purified G D (1 fig) in buffer C 
supplemented with l.l mM MgCl2 and 2% BSA for 8-24 h at 
10 4°C on a shaker. After washing four times with ice-cold 

buffer C r the centrifugation precipitate was subjected to 

* 

SDS-PAGE, electroblotting, and immunodetection with the 
first antibodies (1 jig/ml of 22C11; 10 /ig/ml of anti-Alz 
90; 1/1000 dilution of 1C1; 1/500 dilution of 4G5; 0.1 

15 iig/ml of oGOl) and the second goat anti-mouse or anti- 
rabbit ZgGs conjugated with HRP. (Immunodetection of 1C1 
and 4G5, both of which are mouse IgM (k) , was 
accomplished using as second antibody a mixture of HRP- 
conjugated anti-rabbit IgG, rabbit anti-mouse IgM and 

20 rabbit anti-mouse k antibodies.) The 
three APP constructs prepared as described above are 
compared in the schematic diagram of Fig. 3b. The 
polypeptides encoded by all three constructs retain the 
entire transmembrane and extracellular domains of APP; 

25 while AN (SEQ ID NO: 23) lacks all of the peptide 20 

residues as well as the sequence on the car boxy terminal 
side of the peptide 20 region, AC (SEQ ID NO: 24) retains 
the peptide 20 sequence and is missing only the latter 
sequence. 

30 Sf 9 cells were infected, using standard methods, 

by recombinant baculo viruses encoding full length APP 695 

cDNA (SEQ ID NO: 9), APPi-tss cDNA (aN; SEQ ID NO: 23), or 
APP X _ 676 cDNA (aC; SEQ ID NO: 24) . In uninfected Sf 9 
cells, no immunoreactivity for anti-APP or anti-G 0 
35 antibodies was detected (data not shown) • The membranes 
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of Sf 9 cells infected with the baculoviruses encoding 

APP 695 (SEQ ID NO: 9), AN (SEQ ID NO: 23) r and AC (SEQ ID 

NO: 24) genes (referred to as Sf9-APP 695 , Sf9-AN, and Sf9- 
aC, respectively) were found to express, respectively, 
5 130- , 120- and 130-kDa proteins reactive with antibody 
22C11 (Pig* 3d, right side) • The Sf9-APP 695 cells 
expressed APP at a 0.11 of the total membrane protein . 
When the membranes of the three types of infected cells 
were immunopr ec ip itated with antibody Anti-Alz 90 

10 ( Boehr inger Mannheim) , a mouse monoclonal antibody 

specific for an epitope corresponding to to residues 551- 
608 of APP (SEQ ID NO: 25; a section of APP that is 
within the extracellular domain), 130-kDa, 120-kDa, and 
130-kDa proteins were recognized in Sf9-APP 695 , Sf9-AN, 

15 and Sf9-AC cells, respectively (Fig. 3c, top panel). 
Membranes from ail three types of infected cells showed 
approximately equivalent reactivity to the antibody, 
indicating that at least this portion of the 

k 

extracellular domain was intact on each of the three and 

20 that all three cell types express approximately equal 
amounts of recombinant protein. When the antibody used 
was 1C1, a mouse monoclonal prepared against a peptide 
corresponding to residues 677-695 of APP (SEQ ID NO: 13) , 
only Sf9-APP 695 membranes were reactive, indicating that 

25 the region corresponding to the C-terminal portion of the 
cytoplasmic domain is missing from both AN (SEQ ID 
NO: 23) and AC (SEQ ID NO: 24) (Fig. 3c, middle panel) . 
When the antibody used was 4G5, a mouse monoclonal 
antibody raised against a peptide corresponding to 

30 residues 657-676 of APP (SEQ ID NO: 1; the peptide 20 
region of the cytoplasmic domain) , 130 kDa bands from 
both Sf9-APP 6 9 5 and Sf9-AC membranes reacted with the 
antibody, but Sf9-AN membranes did not, a demonstration 
that AN (SEQ ID NO: 23) but not AC (SEQ ID NO: 24) lacks 

35 the peptide 20 region of APP (Fig. 3c, bottom panel) » 
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These experiments clearly indicate that the expressed 
proteins are recombinant APP^gg (SEQ ID NO: 9),.AFP 1 . 656 

(SEQ ID NO: 23) , and APP^^e ( SE Q 10 N0: 24 >' 
respectively! as designed. 
5 The 22Cll-precipitates from these Sf9 membranes 

expressing various forms of APP were exposed to purified 
G of reprecipitated with 22C11, and subjected to 
immunoblot analysis using anti-G Q a antibody aGOl 
(Pig. 3d, left four lanes) and by 22C11 (right four 

10 lanes). aGOl (Horishita et al», Eur. J. Biochem. 
174: 87-94 , 1988) was provided by Dr. Tomiko Asano; 
similar results were obtained when antibody GC/2 was 
substituted. The control lames are 22Cll-precipitate 
exposed to G 0 in the absence of Sf9 membranes. 

15 Approximately 1/10-1/20 (0.05-0.1 fig/tube) of the 
reconstituted G Q was precipitated, together with a 
comparable amount («0.1 jig/ tube) of APP. Easily 
detectable amounts of G Q a were present in the final 
precipitate when G Q was mixed with 22Cll-precipitates 

20 from Sf 9-aC or Sf9-APP695 membranes, but essentially no 
G e a was found in the final precipitate from Sf 9-aN 
membranes. Thus, formation of an APP-G Q complex requires 
the peptide 20 region, residues 657-676 (SEQ ID NO: 1). 

In the experiment illustrated in Fig. 3e, 22C11- 

25 precipitates from Sf9-APP 695 membranes (100 fig protein 
each) were incubated with activated G Q (lanes 2 and 4) or 
unactivated G G (lanes 1 and 3) ; the final precipitates 
(left panel) and supematants (right panel) were analyzed 
by simultaneous immunoblotting with 22C11 and aGOl 

30 antibodies. Activation of G Q was carried out by 

incubating G Q in 20 mM Hepes/NaOH (pH 7.4), 1 mM EDTA, 2 
mM MgCl 2 f and 1 jxM GTPyS overnight at room temperature. 
When G 0 was incubated with GTPyS, no G Q a associated with 
the APP-22C11 complex (Fig. 3e) , suggesting that the 
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activation state of the 6 protein regulates APP-G D 
association. 

This study suggests that APP functions as a 
receptor coupled to G Q through the G 0 -activator 
5 cytoplasmic domain His 657 -Lys 676 (SEQ ID NO: 1) . APP has 
a point mutation in at least one form of familial 
Alzheimer's disease (Goate et al., Nature 349:704-706, 
1991) . A structural alteration pf APP is therefore 
thought to be one cause of Alzheimer's disease, although 

10 it remains unknown how the mutation might produce the 
disease. One novel possibility suggested by this study 
is that the cytoplasmic, C-terminal fragment of APP is 
pathogenic. It has been suggested (Abraham et al. , 
Biotechnology 7:147-153, 1989; Shivers et al., EMBO J. 

15 7:1365-1370, 1988; Kametani et al., Biomedical Research 
10:179-183, 1989) that the residual C-terminal portion of 
APP may remain in the cell membrane after abnormal 
cleavage of APP to produce 0/A4 protein in Alzheimer's 
disease neurons. By analogy with the oncogenic 

20 transformation of c-erb B into v-arb B, such a structural 
alteration of APP may alter its function and prompt APP 
to . const itutively activate G 0 . This hypothesis is 
consistent with the study (Y anker et al., Science 
245:417-420, 1989) indicating that recombinant expression 

25 of the C-terminal 105-residue portion of APP in neuronal 
cells evokes cell death, and with the reports that G 0 
activity is linked to neuronal growth cone motility 
( Strittmatter et al., BioEssays 13:127-134, 1990), axon 
and dendrite formation (Granneman et al., J. 

30 Neurochemistry 54:1995-2001, 1990), and memory (Guillen 
et al., EMBO J. 9:1449-1455, 1990). This study suggests 
that Alzheimer's disease is a disorder of an AFP-G 0 
signalling system caused by structural alterations of 
APP. 
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gxgropj.e X 

The screening method of the invention can be 
carried out as follows: 

The assay used can be a very simple cell-free 
5 assay employing a first polypeptide consisting 

essentially of the couplone, or G Q -binding portion, of 
APP (SEQ ID NO: 1) and a second polypeptide consisting 
essentially of an APP-binding portion of G D . This APP- 
binding portion of 6 Q may be the 15-residue segment 
10 identified as the anticouplone portion of G D (SEQ ID 
NO: 3) f or it may be one or both of the two flanking 
regions, residues 1-3 (SEQ ID NO: 4) and residues 19-36 

» * 

(SEQ ID NO: 5) of 6 Q . Alternatively, longer portions, or 
all, of APP and/ or 6 Q can be used, or the appropriate 

9 

15 portions of APP and/or G Q can be linked to other 
polypeptides to form hybrid polypeptides with 
characteristics (such as altered immunoreactivity or 
enzymatic activity) that would improve detection of thfe 
endpoint of the assay. The assay is carried out by 

20 contacting the APP-based polypeptide with the G 0 -based 
polypeptide in the presence of a candidate compound, in 
parallel with a control assay containing no candidate 
compound, and determining whether the candidate compound 
inhibits co-immunoprecipitation of the first and second 

25 polypeptides (using either an antibody specific for the 
first polypeptide or an antibody specific for the second 
polypeptide) . Alternatively, activation of the second 
(6 Q ) polypeptide may be the measured criterion: if so, 
the second polypeptide must include the GTP-binding 

■ 

30 region of G Q (SEQ ID NO: 10) , and GTP or an appropriate 
non-hydro lyzable analog thereof (such as GTPyS or 
Gpp(NH)p) must be included in the assay. The assay may 
also be carried out using phospholipid vesicles prepared 
by standard methods (e.g., as described by Nishimoto et 

35 al» r J. Biol. Chem. 264:14029-14038, 1989), provided that 
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the first (APP) polypeptide includes a region of 
hydrophobic amino acids [such as all (SEQ ID NO: 8) or a 
portion (e.g., SEQ ID NO: 7) of the transmembrane region 
of APP) that permit it to be anchored in the phospholipid 
5 bilayer. Alternatively, the assay may be carried out 
using intact cells or red cell ghosts which contain APP 
and G 0 , or appropriate portions thereof. The cells may 
express the first and second polypeptides naturally or by 
virtue of genetic engineering, or the polypeptides may be 
10 introduced directly into the cells or ghosts by standard 
means. 

Example 2 

The progress of Alzheimer's disease may be halted 
or reversed by treating a patient with a compound which 

15 diminishes the activation of neural G Q by truncated APP. 
Such a compound may be identified in a screening assay as 
described above, or may consist essentially of a 
polypeptide containing the amino acid sequence of (a) the 
couplone region of APP (SEQ ID NO: 1) , (b) the 

20 ant icoup lone region of G Q (SEQ ID NO: 3), or (c) the APP- 
associating region(s) of G D (SEQ ID NO: 4 and/or 5) , or a 
combination of (b) and (c) . Such polypeptides may be 
produced in quantity by standard recombinant means, or by 
standard synthetic techniques. To minimize proteolytic 

25 degradation in vivo, the carboxy and amino termini may be 
derivatized (e.g., with ester or amide groups), some or 
all of the amino acids may be replaced with D-amino 
acids, or particularly sensitive peptide linkages may be 
substituted with non-peptide bonds using standard 

30 methodology. To improve penetration of the blood-brain 
barrier (BBB) , the polypeptides may be altered to 
increase lipophilicity (e.g., by esterif ication to a 
bulky lipophilic moiety such as cholesteryl) or to supply 
. a cleavable "targetor" moiety that enhances retention on 
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the brain side of the barrier (Bodor et al.. Science 
257:1698-1700, 1992). Alternatively, the polypeptide may 
be linked to an antibody to the transferrin receptor, in 
order to exploit that receptor's role in transporting 
5 iron across the blood-brain barrier, as taught by Friden 
et al., science 259:373-377, 1993. It is expected that 
an intravenous dosage equivalent to approximately 1 to 
100 jtmoles of the polypeptide of the invention per kg per 
day, or an intrathecally administered dosage of 

10 approximately 0.1 to 50 /imoles per kg per day, will be 
effective in blocking activation of G Q in an Alzheimer's 
patient. If the polypeptide is sufficiently protected 
from proteolytic degradation, as described above, it may 
also be administered orally in appropriately higher 

15 doses. Alternatively, the compound may be incorporated 
into a slow-release implant to ensure a relatively 
constant supply of the therapeutic to the patient's 
brain. 
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(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTHS 

(B) TYPE: 

(C) 6TRANDBDNBSS: 

(D) TOPOLOGY: 



20 

amino acid 
linear 
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Bis Bis Gly Val Val Glu Val Asp Ala Ala Val Thr 
1 5 10 

Els Leu Ser Lye 

20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 

« 

(i) SEQUENCE CHARACTERISTICS l 



Glu Glu 

15 



2: 



1910 

nucleic acid 
double 



(A) LENGTH: 
(B> TYPES 

(C) STRANDEDNESS 3 

(D) TOPOLOGY: 

(zi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 x 
TGTGGCAGGG AAGGGGCCAC C ATG GGA TGT ACG CTG AGC GCA GAG GAG AGA 

Met Gly Cys Thr Leu Ser Ala Glu Glu Arg 
15 10 

GCC GCC CTC GAG CGG AGC AAG GCG ATT GAG AAA AAC CTA AAA GAA GAT 
Ala Ala Leu Glu Arg Ser Lys Ala lie Glu Lye Asn Leu Lys Glu Asp 

15 20 25 



GGC ATC AGC GCC GCC AAA GAC GTG AAA TTA 

Gly lie Ser Ala Ala Lys Asp Val Lys Leu 

30 35 



CTG CTG GGO OCT GGA 
Leu Leu Gly Ala Gly 
40 



GAA TCA GGA AAA AGC ACC ATT GTG AAG CAG ATG AAG ATC ATC CAT GAA 
Glu Ser Gly Lys Ser Thr lie Val Lys Gin Met Lys lie lie His Glu 
45 50 



GAT GGC TTC TCT GGG GAA GAC GTG AAG CAG TAG AAG OCT GTG GTC TAC 
Asp Gly Phe Ser Gly Glu Asp Val Lys Gin Tyr Lys Pro Val Val Tyr 
60 65 70 

AGC AAC ACC ATC CAG TCT CTG GCG GCC ATT GTC CGG GCC ATG GAC ACT 
Ser Asn- Thr lie Gin Ser Leu Ala Ala lie Val Arg Ala Met Asp Thr 
75 80 85 90 

TTC GGC GTG GAG TAT GGT GAC AAG GAG AGG AAG ACG GAC TCC AAG ATG 

Leu Gly Val Glu Tyr Gly Asp Lys Glu Arg Lys Thr Asp Ser Lys Met 

95 100 105 

GTG TGT GAC GTG GTG AGT OCT ATG GAA GAC ACT GAA COG TTC TCT GCA 

Val Cys Asp Val Val Ser Arg Met Glu Asp Thr Glu Pro Phe Ser Ala 

110. 115 120 

GAA CTT CTT TCT GCC ATG ATG CGA CTC TGG GGC GAC TOG GGG ATC CAG 
Glu Leu Leu Ser Ala Met Met Arg Leu Trp Gly Asp Ser Gly lie Gin 
125 130 135 

GAG TGC TTC AAC CGA TCT CGG GAG TAT CAG CTC AAT GAC TCT GCC AAA 
Glu Cys Phe Asn Arg Ser Arg Glu Tyr Gin Leu Asn Asp Ser Ala Lys 
140 145 150 

TAC TAC CTG GAC AGC CTG GAT CGG ATT GGA GCC GGT GAC TAC GAG CCC 

Tyr Tyr Leu Asp Ser Leu Asp Arg lie Gly Ala Gly Asp Tyr Gin Pro 

160 165 170 



51 



99 



147 



195 



243 



291 



339 



387 



435 



463 



531 
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ACT GAG CAG GAC ATC CTC GGA ACC AGA GTC AAA AGA ACT GGC ATC GTA 579 
Thr Glu Gin Asp lie Leu Arg Thr Arg Val Lye Thr Thr Gly lie Val 

175 180 185 

> 

GAA ACC CAC TTC ACC TTC AAG AAC CTC CAC TTC AGG CTG TTT GAC GTC 627 
Glu Thr Bis Phe Thr Phe Lys Abu Leu His Phe Arg Leu Phe Asp Val 

190 195 200 

GGG GGC CAG GGA TCT GAA CGC AAG AAG TGG ATC CAC TGC TTT GAG GAT 675 
Oly Gly Gin Arg Ser Glu Arg Lye Lys Trp lie Bis Cys Phe Glu Asp 
205 210 215 

GTC ACG GCC ATC ATC TTC TGT GTC GGA CTC AGC GGC TAT GAC GAG GTG 723 
Val Thr Ala lie lie Phe Cys Val Ala Leu Ser Gly Tyr Asp Gin Val 
220 225 230 

* 

CTC CAC GAG GAC GAA ACC ACG AAC CGC ATG CAC GAG TCT CTC ATG CTC 771 
Leu His Glu Asp Glu Thr Thr Asn Arg Met His Glu Ser Leu Met Leu 
235 240 245 250 

TTC GAC TCC ATC TGT AAC AAC AAG TTT TTC ATT GAT ACC TGC ATC ATC 619 
Phe Asp Ser lie Cys Asn Asn Lys Phe Phe lie Asp Thr Ser lie He 

255 260 265 

CTC TTC CTC AAC AAG AAA GAC CTC TTT GGC GAG AAG ATT AAG AAG TGA 867 
Leu Phe Leu Asn Lys Lys Asp Leu Phe Gly Glu Lys He Lys Lys Ser 

270 275 280 

CCC TTG ACC ATC TGC TTT CCC GAA TAC CCA GGC TCC AAC ACC TAT GAA 915 
Pro Leu Thr He Cys Phe Pro Glu Tyr Pro Gly Ser Asn Thr Tyr Glu 
285 " 290 295 

GAT GGA GCT GCC TAC ATC GAA ACA GAG TTT GAA AGC AAA AAC CGC TGA 963 
Asp Ala Ala Ala Tyr He Gin Thr Gin Phe Glu Ser Lys Asn Arg Ser 
300 305 310 

CCC AAC AAA GAA ATT TAC TGT CAC ATG ACT TGT GCC AGA GAG ACG AAT 1011 
Pro Asn Lys Glu He Tyr Cys His Met Thr Cys Ala Thr Asp Thr Asn 
315 320 325 330 

AAT ATC CAG GTG GTA TTC GAC GCC GTC ACC GAC ATC ATC ATT GCC AAC 1059 
Asn He Gin Val Val Phe Asp Ala Val Thr Asp He He He Ala Asn 

335 340 345 



AAT CTC GGG GGC TGC GGC TTG TAC TGACCTCTTG TCCTGTATAG GAAGCTATTT 1113 

Asn Leu Arg Gly Cys Gly Leu Tyr 

350 



GACTGCTTCA 

• 


TGGACTCTTT 


GCTGTTGATG 


TTGATCTCCT 


GGTAGGATGA CCTTTGGCCT 


1173 


TTGTAAGACA 


CACAGCCTTT 


CTGTACCAAG 




ACCTACGACC OGAGAGTGAC 


1233 


TGACGGCTGT 


GTATTTCTGT 


AGAATGCTGT 


AGAATACAGT 


TTTAGTTGAG TCTTTACATT 


1293 


TAGAACTTGA 


AAGGATTTTA 


AAAAACAAAA 


GAAAAACCAT 


TTCTGATGTG CTTTGTAGCT 


1353 


TTAAAAGAAA 


AAAGGAAAAC 


TCACCATTTA 


ATCCATATTT 


CCTTTTTATT TTGAAGTTTA 


1413 


AAAAAAAAAT 


GTCTGTACCC 


ACACCCTC0C 




CTCAGCAGAA CTGGGGCTGG 


1473 


CACACAGAGG 


GA6TGCTGGG 


CCTGGGGCCT 


CGCAGGGCTT 


CTGTGGAGCC GATGGCTGGT 


1533 


GGGAACATGT 


CAGGCTAGTC 


TGTCTAGAAG 


GCCACTGGCC 


ACTGTACCGA CCCTTCCCGA 


1593 
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TCCCTGTGGG CTGCCCAGAC ACCTCATATA CCACCAGGCA GTGGCAQCTC CGCCCTGCTC 1653 

AGCCATGCGA CTCCAAACAC ACTCAAAGTT TGCGTAGAAA AAGCACAGCT CTGGCA6GGG 1713 

TAGCTGCCAC AGACAACGCT CATCACCTAT AGAAATCCAG CCCTATAGAA GCAATTCACC 1773 

CAGCCCCTTC CTACACTCCC TTTGTGTTGT TAACTTTTTG GTTTTTCTGO TCCTAOTGAG 1833 

TGCCTCCCAT GCATACCTGA CCAGCTCTGC GAGTGTCTGG GGTCTGGGGA ACAGGGGTTG 1893 

TGTGGTXTGG TTTTTGG 1810 

(2) INFORMATION FOR SEQUENCE IDENTI PI CAT ION NUMBER: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH I 15 

(B) TYPE i amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY s linear 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO I 3: 

Aap Ala Val Thr Asp lie lie lie Ala Lys Asn Leu Arg Gly Cye 
1 5 10 15 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENG THS 3 

<B) TYPSs amino acid 

(C) STRANDED NESS t 

(D) TOPOLOGY s linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4s 

Met Gly Cys 
1 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 5: 
<i) SEQUENCE CHARACTERISTICS s 

(A) LENG THS 18 

(B) TYFEs amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

lie Glu Lya Asn Leu Lys Glu Asp Gly lie Set Ala Ala Lys Asp Val 
1 5 10 15 

Lys Leu 

» 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 6: 
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■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 47 

(B) TYPE: amino acid 

(C) STRANDEDNESSs 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION I SEQ ID NOs St 

Lys Lye Lye Gin Tyr Thr Ser lie His His Gly Val Val Glu Val Asp 
1 5 10 15 

Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn 

■ 20 25 30 

Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
35 40 45 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 7: 

■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPEs amino acid 

(C) STRANDBPNESS: 

(D) TOPOLOGYs linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Thr Val He Val He Thr Leu Val Met Leu 
15 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 8; 
- (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID MO: 8: 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
1 5 10 15 

He Val He Thr Leu Val Met Leu 

20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 9 s 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2085 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGYs linear 
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(Xi) SEQUENCE DESCRIPTION J SBQ ZD NO; 9: 

ATG CTG CCC GOT TTG GCA CTG CTC CTG CTG GOT* GCC TGG ACG GCT CGG 48 
Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

GCG CTG GAG GTA CCC ACT GAT GGT AAT GCT GGC CTG CTG GCT GAA CCC 96 
Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

■ 

CAG ATT GCC ATG TTC TGT GGC AGA CTG AAC ATG CAC ATG AAT GTC CAG 144 
Gin lie Ala Met Phe Cye Gly Arg Leu Aen Met Hia Met Asn Val Gin 
35 40 45 

AAT GGG AAG TGG GAT TCA GAT CCA TCA GGG ACC AAA ACC TGC ATT GAT 192 
Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

ACC AAG GAA GGC ATC CTG CAG TAT TGC CAA GAA GTC TAC CCT GGA CTG 240 
Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Gly Leu 
65 70 75 80 

CAG ATC ACC AAT GTG GTA GAA GCC AAC CAA CCA GTG ACC ATC CAG AAC 288 
Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

TGG TGC AAG CGG GGC' GGC AAG CAG TGC AAG ACC CAT CCC CAC TTT GTG 336 
Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

100 105 110 

ATT CCC TAC CGC TGC TTA GTT GGT GAG TTT GTA AGT GAT GCC CTT CTC 384 
lie Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

GTT CCT GAC AAG TGC AAA TTC TTA CAC CAG GAG AGG ATG GAT GTT TGC 432 
Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met. Asp Val Cys 
130 135 140 

GAA ACT CAT CTT CAC TGG CAC ACC GTC GCC AAA GAG ACA TGC AGT GAG 480 
Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

AAG AGT ACC AAC TTG CAT GAC TAC GGC ATG TTG CTG CCC TGC GGA ATT 528 
Lys Ser Thr Asn Leu Bis Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
J 165 170 175 

GAC AAG TTC CGA GGG GTA GAG TTT GTG TGT TGC CCA CTG GCT GAA GAA 576 
Asn Lvs Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

180 185 190 

AGT GAC AAT GTG GAT TCT GCT GAT GCG GAG GAG GAT GAC TGC GAT GTC 624 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Cys Asp Val 
195 200 205 

TGG TGG GGC GGA GCA GAC ACA GAC TAT GCA GAT GGG AGT GAA GAC AAA 672 
Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

GTA GTA GAA GTA GCA GAG GAG GAA GAA GTG GCT GAG GTG GAA GAA GAA 720 
Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 * 230 235 240 
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GAA GCC GAT GAT GAC GAG GAC GAT GAG GAT GGT GAT GAG GTA GAG GAA 768 
Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 



GAG GOT GAG GAA CCC TAC GAA GAA GCC ACA GAG AGA ACC ACC AGC ATT 816 
Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 

260 265 270 

GCC ACC ACC ACC ACC ACC ACC AGA GAG TCT GTG GAA GAG GTG GTT CGA 864 
Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

GTT CCT ACA ACA OCA GCC AGT ACC CCT GAT GCC GTT GAC AAG TAT CTC 912 
val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

GAG ACA CCT GGG GAT GAG AAT GAA CAT GCC CAT TTC GAG AAA GCC AAA 960 
Glu Thr Pro Gly Asp Glu Asa Glu His Ala Bis Phe Gin Lys Ala Lys 
305 310 315 320 

GAG AGG CTT GAG GCC AAG CAC CGA GAG AGA ATG TCC CAG GTC ATG AGA 1008 

Glu Arg Leu Glu Ala Lys Bis Arg Glu Arg Met Ser Gin Val Met Arg 

325 330 335 

GAA TGG GAA GAG OCA GAA CGT GAA GGA AAG AAG TTG CCT AAA OCT GAT 1056 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 350 

AAG AAG GGA GTT ATC GAG GAT TTC CAG GAG AAA GTG GAA TCT TTG GAA 1104 
Lys Lys Ala Val He Gin Bis Phe Gin Glu Lys Val Glu Ser Leu Glu 

360 365 



CAG GAA GCA GCC AAC GAG AGA CAG CAG CTG GTG GAG ACA CAC ATG GCC 1152 
Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr Bis Met Ala 
370 375 380 

AGA GTG GAA GCC ATG CTC AAT GAC CGC CGC CGC CTG GCC CTG GAG AAC 1200 
Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 

390 395 400 



TAC ATC ACC GCT CTG CAG GCT GTT CCT CCT CGG CCT CGT CAC GTG TTC 1248 
Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg Bis Val Phe 

405 410 415 

AAT ATG CTA AAG AAG TAT GTC CGC GCA GAA CAG AAG GAC AGA CAG CAC 1296 
Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 

420 " 425 430 

ACC CTG AAG CAT TTC GAG CAT GTG CGC ATG GTG GAT CCC AAG AAA GCC 1344 
Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

GCT CAG ATC CGG TCC CAG GTT ATG ACA CAC CTC CGT GTG ATT TAT GAG 1392 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

CGC ATG AAT CAG TCT CTC TCC CTG CTC TAC AAC GTG CCT GCA GTG GCC 1440 
Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala val Ala 
465 470 475 480 

GAG GAG ATT CAG GAT GAA GTT GAT GAG CTG CTT CAG AAA GAG GAA AAC 1488 
Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 
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UVT TCA GAT GAC GTC TTG GCC AAC ATG ATT A6T GAA CCA AGS ATC AGT 
Tyr ser Asp Asp Val Leu Ala Asn Met lie Ser Olu Pro Arg lie 8er 

500 505 510 

TAC OGA AAC GAT GCT CTC ATG CCA TCT TTG ACC GAA ACG AAA ACC ACC 
Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lye Thr Thr 
515 520 525 

GTG GAG CTC CTT CCC GTG AAT GGA GAG TTC AGC CTG GAC GAT CTC CAG 
Val Glu Leu Leu Pro Val Asa Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

CCG TGG CAT TCT TTT GGG GCT GAC TCT GTG CCA GCC AAC ACA GAA AAC 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

GAA GTT GAG CCT GTT GAT GCC CGC CCT GCT GCC GAC CGA GGA CTG ACC 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

565 570 575 

ACT CGA CCA GGT TCT GGG TTG ACA AAT ATC AAG ACG GAG GAG ATC TCT 
Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lye Thr Glu Glu Zle Ser 

580 585 590 

GAA GTG AAG ATG GAT GGA GAA TTC CGA CAT GAC TCA GGA TAT GAA GTT 
Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 

600 605 



1536 



CAT CAT GAA AAA TTG GTG TTC TTT GGA GAA GAT GTG GGT TCA AAC AAA 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly ser Asn Lys 
610 615 620 

GGT GCA ATC ATT GGA CTC ATG GTG GGC GGT GTT GTC ATA GCG ACA GTG 
Gly Ala lie Zle Gly Leu Met Val Gly Gly Val Val lie Ala Thr Val 

630 635 640 



ATC GTC ATC ACC TTG GTG ATG CTG AAG AAG AAA CAG TAC ACA TCC ATT 
lie Val He Thr Leu Val Met Leu Lys Lys Lys Glu Tyr Thr Ser He 

645 650 



CAT CAT GGT GTG GTG GAG GTT GAC GCC GCT GTC ACC CCA GAG GAG CGC 
His Hie Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 

660 665 670 

GAC CTG TCC AAG ATG CAG CAG AAC GGC TAC GAA AAT CCA ACC TAC AAG 
His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

TTC TTT GAG CAG ATG CAG AAC 
Phe Phe Glu Gin Met Gin Asn 
690 695 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2085 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 10 1 



16 

amino acid 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 

(B) TYPES 

(C) STRANDEDNBSS : 

(D) TOPOLOGY l 

<xl) SEQUENCE DESCRIPTION : SEQ ID NOs 10s 
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Lya lieu Leu Leu Leu Gly Ala Oly Glu Ser Oly Lye 8er Thr lie Val 
1 5 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER t lis 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENG THS 10 

(B) TYPE i amino acid 

(C) STRANDED NESS s 

(D) TOPOLOGY x linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Met Leu Pro Gly Leu Ala Leu Leu Leu Leu 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 12 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPEs amino acid 

(C) STRANDEDNB6S : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs 12 : 



Asp Ala Glu Phe Arg His Asp Ser Gly Tyr 
1 5 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 13: 
(i) 



(A) LENG TH i 19 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin 
15 10 15 

Met Gin Asn 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 14: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 

(B) TYPE: amino aeid 

<C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 x 

Hia Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg Hie 
15 10 15 

Leu Ser Lye 

<2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER I 15 s 

* 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH i IB 

(B) TXPEs amino acid 

(C) STRANDEDNESS I 

(D) TOPOLOGY i linear 

(xi) SEQUENCE DESCRIPTION s SEQ ID NO: 15s 

Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu 
IS 10 15 

Ser Lys 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 16: 
(1) 



(A) LENGTHS 15 

(B) TYPE: amino acid 

(C) STRANDEDNESS s 

(D) TOPOLOGY s linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs 16s 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu 
15 10 15 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 17s 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 26 

(B) TYPEs amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY i linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NOs 17s 

Lys Gin Tyr Thr Ser He His His Gly Val Val Glu Val Asp Ala Ala 
1 5 10 15 

Val Thr Pro Glu Glu Arg His Leu Ser Lys 

20 25 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 18s 
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(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 30 

(B) TYPE i amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY i linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 18: 

■ 

Thr Val lie Val lie Thr Leu Val Met Leu His Hie Gly Val Val Glu 
15 10 15 

Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lye 

20 25 30 

(2) INFORMATION PGR SEQUENCE IDENTIFICATION NUMBER: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19s 

CCACGCAGGA TCAOGGGATC CATGCTGCCC AGCTTG 36 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 20: 
(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 6 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS I single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



GGATCC 6 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 21: 
(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CAGTACACAT CGATCTGATG AGATCATGGC GTGGTG 36 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 22: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 35 

<B) TYPE: nucleic acid 

(C) STRAND2DNESS s single 

(D) TOPOLOGY! linear 

a 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 22s 
CGCCATCTCT CCAGTGATGA ATGCAGCAGA ACGGA 35 

* 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 23s 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 

(B) TYPE: amino acid 

(C) STRAHDEDNESS : 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

* 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 . 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

Gin lie Ala Met Phe Cya Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lyo Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cya Gin Glu Val Tyr Pro Gly Leu 
65 * 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr Hia Pro His Phe Val 

100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 



Val Pro Asp Lys Cys Lys Phe Leu Bis Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu HiB Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu, His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Cys Asp Val 
195 200 205 
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Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

val val Glu val Ala Glu Glu Glu Glu Val Ala Glu val Glu Glu Glu 

230 235 240 



Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 



♦ Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 

260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 

325 330 



Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 350 

Lys Lys Ala Val lie Gin Bis Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 

390 395 400 



Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 

405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 

420 425 430 

Thr Leu Lye His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 . 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg lie Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 
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Pro Trp Bis Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Oly Leu Thr 

565 570 



Thr Arg Pro Gly Ser Gly Leu Thr Aan lie Lya Thr Glu Glu lie Ser 

580 585 590 

Glu Val Lye Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Oly Ala He He Gly Leu Ket Val Gly Gly Val Val He Ala Thr val 

630 635 640 



He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 

645 650 655 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 24s 
(i) 



(A) LENGTHS 676 

(B) TYPEs amino acid 

(C) 



(Xi) SEQUENCE DESCRIPTIONS SEQ ID NOi 24 s 

Ket Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lye Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Gly Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

100 105 HO 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 



Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 
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Lye Ser Thr Asn Leu Hie Asp Tyr Gly Met Leu Leu 

170 



Asp Lys Phe Arg Gly Val Glu Phe Val Cys eye Pro 

180 165 



Cys Gly He 
175 

Ala Glu Glu 
190 



ser Asp Ann Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Cys Asp Val 
195 200 205 



Trp Gly Gly Ala Asp Thr Asp Tyr 
210 215 



Ala Asp Gly Ser Glu Asp Lys 
220 



Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 

230 235 240 



Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 

Glu Al£ Glu Glu Pro 

260 



Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 

265 270 



Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 



Glu Thr Pro Gly 

305 ■ 



Glu Asn Glu Bis Ala His Phe Gin Lys Ala Lys 
310 315 320 



Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 

330 



Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 

360 365 



Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Leu Ala Leu Glu Asn 

400 



Arg Val Glu Ala Met Leu Asn Asp Arg 
385 390 



395 



Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro 

405 410 



Arg His Val Phe 
415 



Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 

420 425 430 

Thr Leu Lys Hie Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr Bis Leu Arg Val He Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

• 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 
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Tyx Ser Asp Asp Val Lea Ala Asn Met lie Ser Glu Pro Arg lie Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro ser Leu Thr Glu Tbr Lys Thr Thr 
515 520 



Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp Hie Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 # 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

570 575 



Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu lie Ser 

560 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 500 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala He lie Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val lie Thr Leu* Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 

645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 

660 665 670 

His Leu Ser Lys 
675 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 25: 
(i) .SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(P) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Asp Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val Asp 
■ 1 5 10 15 

Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser Gly 

20 25 30 

Leu Thr Asn He Lys Thr Glu Glu He ser Glu Val Lys Met Asp Ala 
35 40 45 

Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
50 55 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 26$ 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 

(B) TYPE: amino acid 

(C) STRANDEDNBSS t 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO* 26 1 

Val lie Val lie Thr Leu Val Met Leu Lys Lya Lys Gin Tyr Thr Ser 
15 10 15 

lie His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu 

20 25 30 

Arg His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Aan Pro Thr Tyr 
35 40 45 

■ 

Lys Phe Phe Glu Gin Met Gin Asn 
50 55 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 27 x 

(i) SEQUENCE CHARACTERISTICS J 

(A) LENGTH: 695 

<B) TYPE: amino acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Gly Leu 
65 70 75 SO 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lya Arg Gly Arg Lya Gin Cys Lys Thr His Pro His Phe Val 

100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 
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Val Pro Asp LyB Cya Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr Bis Leu His Trp His Thr Val Ala Lys* Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Cys Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 



Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 

260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 

325 330 



Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 

360 365 



Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 

405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lye Asp Arg Gin His 

420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 
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Arg Met Abu Gin Ser Leu Ser Leu Leu Tyr Ann Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu lie Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met lie Ser Glu Pro Arg lie Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 

520 525 



Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp Bis Ser Phe Gly Ala Asp ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

570 575 



Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu lie Ser 

580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg Bis Asp Ser Gly Tyr Glu Val 

600 605 



His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Ash Lys 
610 615 620 

Gly Ala lie lie Gly Leu Met Val Gly Gly Val Val lie Ala Thr Val 
625 630 635 640 

lie Val lie Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 

645 650 



His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 

660 665 670 

■ 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 6B5 

Phe Phe Glu Gin Met Gin Asn 
690 695 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER* 28* 

(1) SEQ UENCE CHARACTERISTICS f 

(A) LENGTH: 2274 

(B) TYPES nucleic acid 

(C) STRANDED NESS t double 
(D> TOPOLOGY a linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NOl 28s 

GCTGTGGCAG GGAAGGGGCC ACC ATG GGA TGT ACG CTG AGO OCA GAG GAG 50 

Met Gly Cys Thr Leu Ser Ala Glu Glu 
1 5 

A6A GCC GCC CTC GAG CGG AGC AAG GOG ATT GAG AAA AAC CTC AAA GAA 98 
Arg Ala Ala Leu Glu Arg Ser Lys Ala lie Gin Lys Asn Leu Lys Glu 
10 15 20 25 



* 
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GAT G6C ATC AGC GCC GCC AAA GAO GTG AAA TTA CTC CTG CTO GGG GCT 146 

Asp Gly lie Ser Ala Ala Lye Asp Val Lye Leu Leu Leu Leu Gly Ala 

30 35 40 

GGA GAA TCA GGA AAA AGC ACC ATT GTG AAG GAG ATG AAG ATC ATC CAT 194 

Gly Glu Ser Gly Lys Ser Thr He Val Lye Gin Ket Lys He He Bis 

45 50 55 

GAA GAT GGC TTC TCT GGG GAA GAG GTG AAG CAG TAC AAG CCT GTG GTC 242 
Glu Asp Gly Phe Ser Gly Glu Asp Val Lys Gin Tyr Lys Pro Val Val 
60 65 70 

TAC AGC AAC ACC ATC CAG TCT CTG GCG GCC ATT GTC CGG GCC ATG GAC 290 
Tyr Ser Asn Thr He Gin Ser Leu Ala Ala He Val Arg Ala Met Asp 
75 80 85 

ACT TTG GGC GTG GAG TAT GGT GAC AAG GAG AGG AAG AOS GAC TCC AAG 338 
Thr Leu Gly Val Glu Tyr Gly Asp Lys Glu Arg Lys Thr Asp Ser Lys 
90 95 100 105 

ATG GTG TGT GAC GTG GTG AGT CGT ATG GAA GAC ACT GAA CCG TTC TCT 386 
Ket Val Cys Asp Val Val Ser Arg Met Glu Asp Thr Glu Pro Phe Ser 

110 115 120 

GGA GAA CTT CTT TCT GCC ATG ATG CGA CTC TGG GGC GAC TCG GGG ATC 434 

Ala Glu Leu Leu ser Ala Met Met Arg Leu Trp Gly Asp Ser Gly He 

125 130 



CAG GAG TOO TTC AAC CGA TCT CGG GAG TAT CAG CTC AAT GAC TCT GCC 482 
Gin Glu Cys Phe Asn Arg Ser Arg Glu Tyr Gin Leu Asn Asp Ser Ala 
140 145 150 

AAA TAC TAC CTG GAC AGC CTG GAT CGG ATT GGA GCC GGT GAC TAC CAG 530 
Lys Tyr Tyr Leu Asp Ser Leu Asp Arg He Gly Ala Gly Asp Tyr Gin 

160 165 



CCC ACT GAG CAG GAC ATC CTC CGA ACC AGA GTC AAA AGA ACT GGC ATC 578 
Pro Thr Glu Gin Asp He Leu Arg Thr Arg Val Lys Thr Thr Gly He 
170 175 180 185 

GTA GAA ACC GAC TTC ACC TTC AAG AAC CTC CAC TTC AGG CTG TTT GAC 626 
Val Glu Thr Bis Phe Thr Phe Lys Asn Leu His Phe Arg Leu Phe Asp 

190 195 200 

GTC GGG GGC CAG CGA TCT GAA CGC AAG AAG TGG ATC CAC TCC TTT GAG 674 
Val Gly Gly Gin Arg Ser Glu Arg Lys Lys Trp He His Cys Phe Glu 

205 210 215 

GAT GTC AGO GCC ATC ATC TTC TGT GTC GGA CTC AGC GGC TAT GAC CAG 722 
Asp Val Thr Ala He He Phe Cys Val Ala Leu Ser Gly Tyr Asp Gin 
220 225 230 

GTG CTC CAC GAG GAC GAA ACC AGG AAC CGC ATG CAC GAA TCC CTG AAG 770 
Val Leu His Glu Asp Glu Thr Thr Asn Arg Met His Glu Ser Leu Lys 

240 245 



CTC TTC GAC AGC ATC TGC AAC AAC AAG TGG TTC AGA GAC AGA TCT ATT 818 
Leu Phe Asp ser lie Cys Asn Asn Lys Trp Phe Thr Asp Thr Ser He 
250 255 260 265 

ATC CTG .TTT CTC AAC AAG AAG GAC ATA TTT GAG GAG AAG ATC AAG AAG 866 
He Leu Phe Leu Asn Lys Lys Asp He Phe Glu Glu Lys He Lys Lys 

270 275 280 
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TCC CCA CTC ACC ATC T6C TTT CCT GAA TAC ACA CCC CCC ACT OCC TTC 914 ■ 

Ser Pro Leu Thr lie Cya Phe Pro Olu Tyr Thr Oly pro Ser Ala Phe 

285 290 295 

ACA GAA GCT GTG GCT CAC ATC GAA GG6 CAC TAT GAG AGT AAG AAT AAG 962 

Thr. Glu Ala Val Ala His lie Gin Gly Gin Tyr Glu Ser Lye Asn Lys 
300 305 310 

TCA GCT CAC AAG GAA GTC TAC AGC CAT GTC ACC TGT GCC ACG GAC ACC 1010 
Ser Ala Bis Lys Glu Val Tyr Ser His Val Thr Cya Ala Thr Asp Thr 
315 320 325 

AAC AAC ATC GAA TTC GTC TTT GAT GCC GTG ACA GAT GTC ATC ATC GCC 1058 
Asn Asn He Gin Phe Val Phe Asp Ala Val Thr Asp Val He lie Ala 
330 335 340 345 

AAA AAC CTA CGG GGC TGT GGA CTC TAC TGAGCCCTGG CCTCCTACCC 1105 
Lys Asn Leu Arg Gly Cys Gly Leu Tyr 

350 



AGCCTGCCAC 


TCACTCCTCC 


CCTGGACCCA 


GAGCTCTGTC 


ACTGCTCAGA 

• 


TGCCCTGTTA 


1165 


ACTGAAGAAA 


ACCTGGAGGC 


TAGCCTTGGG 


GGCAGGAGGA 


GGGATOCTTT 


GAGCATCCCC 


1225 


ACCCCACCCA 


ACTTCAGCCT 


CGTGACACGT 


GGGAACAGGG 


TTGGGCAGAG 


GTGTGGAACA 


1285 


GCACAAGGCC 


• 

AuAUAU^AUf 




lUUlI X \ J\* x vx\^ 


TCACTGGTCA GCTGTGTGTC 


1345 


TTACAGAGAG 

• 

CTGTACCTCT 


GCCGAGTGGG 
TGTTCAGTGT 


CAACACTGCC 
CCTGGTTTCT 


ATCTGATTCA 
CTTCCACCTT 


GAATGGGCAT 
GGTGATAGGA 


GCCCTGTCCT 
TGGCTGGCAG 


1405 
1465 


GAAGGCCCCA 


TGGAAGGTGC 


TGCTTGATTA 


GGGGATAGTC 


GATGGGATCT 


CTCAGGAGTC 


1525 


CTCAGGGTCT 


GTTTGGTAGA 


GGGTGGTTTC 


GTOGACAAAA 


GCCAACATGG 


AATCAGGCCA 


1585 


CTTTTGGGGC 


GCAAAGACTC 


AGACTTTGGG 


GAOGGGTTCC 


CTCCTCCTTC 


ACTTTGGATC 


1645 


TTGGCCCCTC 


TCTGGTCATC 


TTCOCTTGCC 


CTTGGGCTCC 


CCAGGATACT 


CAGCCCTGAC 


1705 


TCCCATGGGG 


TTGGGAATAT 


TCCTTAAGAC 


TGGCTGACTG 


CAAAGGTGAC 


CGATGGAGAA 


1765 


ACATCCCTGT 
ATAGTTGATG 


GCTACAGAAT 
ACAAGCCCTG 


TGGGGGTGGG 
AGAATGCCAT 


AGAGCTGAGG 
CTGCTGGCTC 


GGGCAGGCGG 
CACTCAGACG 


CTCTTTCCTG 
GGCTCAACTG 


1825 
1885 


TCCTGGGTGA 


TAGTGACTTG 


CCAGGCCACA 


GGCTGCAGGT 


GAGAGACAGA 


GCAGGCAAGC 


1945 


AGCCTTGCAA 


CTGCAGATTA 


CTTAGGGAGA 


AGCATCCTAG 


CCCCAGCTAA 


CTTTGGACAG 


2005 


TCAGCATATG 


TCCCTGCCAT 


CCCTAGACAT 


CTCCAGTCAG 


CTGGTATCAC AGCCAGTGGT 


2065 


TGAGAGAGGT 


TTGAATGCTC 


ATGTGGGAGG 


GGGCCCGGTA 


CCCAGCTTTT 


GTTCCCTTTA 


2125 


GTGAGGGTTA 


ATTGCGCGCT 


TGGGCTAATC 


ATGGTCATAG 


CTGTTGGGCG 


TTGCTGGCGT 


2185 


TTTTCCATAG 


GCTCCGCCCC 


CTGACGAGAT 


CACAAAAATC 


GAOGCTGAAG 


TCAGAGGTGG 


2245 


CGAAACCGAC 


AGACTATAAG 


ATACCAGGC 








2274 
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(2) INFORMATION TOR SEQUENCE IDENTIFICATION NUMBER* 29s 
(i) SEQUENCE CHARACTERISTICS* 

a 

(A) LENGTH! 18 

(B) TYPE* amino acid 

(C) STRANDEDNBSS * 

(D) TOPOLOGY: linear 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO; 29: 

Asp val Gly Gly Gin Arg Ser Glu Arg Lys Lys Trp lie His Cye Phe 
1 5 10 15 

Glu Asp 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Thr Ser lie lie Leu Phe Leu Asn Lys Lys Asp Leu 
IS 10 
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1. A method of identifying a therapeutic useful 
for treating or preventing the symptoms of Alzheimer's 
disease, which method includes the steps of 
5 contacting (a) a first molecule comprising the 

couplone portion (SEQ ID NO: 1) of amyloid precursor 
protein (APP) with (b) a second molecule comprising an 
APP-associating region of G Q (SEQ ID NOs: 3, 4, or 5) , in 
the presence of a candidate compound; and 
10 determining whether said candidate compound 

interferes with the association of said first and second 
molecules, said interference being an indication that 
said candidate compound is a therapeutic useful for 
treating Alzheimer's disease. 

■ 

15 2. The method of claim 1, wherein said 

determining step is accomplished by 

immxnunoprecipitating said first molecule with an 
antibody specific for APP; and 

detecting the presence or amount of said second 
20 molecule which co-precipitates with said first molecule. 

3. The method of claim 1, wherein said 
determining step is accomplished by 

immunoprecipitating said second molecule with an 
antibody specific for G D ; and 
25 detecting the presence or amount of said first 

molecule which co-precipitates with said second molecule. 



4* The method of claim 1, wherein said first 
molecule comprises the portion of APP 695 from residues 649 
to 695 (SEQ ID NO: 6) . 
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5. The method of claim 1, wherein said first 
molecule comprises the portion of ,APP 695 from residues 639 
to 648 (SEQ ID NO: 7) . 

6. The method of claim 1, wherein said first 

5 molecule comprises the portion of APP 695 from residues 640 
to 695 (SEQ ID NO: 26) . 

7. The method of claim 6, wherein said first 
molecule comprises essentially all of APP 695 (SEQ ID 
NO: 27). 

m 

10 8. The method of claim 1, wherein said second 

molecule comprises the GTP-binding region of G 0 (SEQ ID 
NO: 10) . 

9. The method of claim 8, wherein said second 
molecule comprises essentially all of G Q (SEQ ID NO: 2) . 

15 10. A method of assaying for a therapeutic useful 

* 

for treating Alzheimer's disease, which method includes 
the steps of 

contacting (a) a first molecule comprising the 
couplone region of APP (SEQ ID NO: 1) with (b) a second 
20 molecule comprising an APP-associating region of G Q (SEQ 
ID NO: 3, 4, or 5), in the presence of a candidate 

compound; and 

determining whether said candidate compound 
interferes with the activation of said second molecule by 
25 said first molecule, said interference being an 

indication that said candidate compound is a therapeutic 
useful for treating Alzheimer's disease. 

* 

11. The method of claim 10, wherein said 
determining step is accomplished by 
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contacting said second molecule with a substrate 
comprising GTP or an analog of GTP; and 

detecting or measuring the binding of said 
substrate to said second molecule, wherein said binding 
5 is evidence of said activation of said second molecule by 
said first molecule. 

12. The method of claim 1, wherein said 
contacting step is carried out at a Mg 2 * concentration 
between ixlO" 7 and lxl(T 2 M. 
■ 

10 13. The method of claim 10, wherein said 

contacting step is carried out at a Mg 2 * concentration 
between IxlO" 7 and lxl0~ 2 M. 

14. The method, of claim 1, wherein said 
contacting step is carried out in a cell-free system. 

15. The method of claim 10, wherein said 
contacting step is carried out in a cell-free system. 

* 

16. A system for screening candidate Alzheimer's 
disease therapeutics, which system comprises 

a first polypeptide comprising a sequence 
essentially identical to that of peptide 20 (SEQ ZD 
NO: 1); 

a second polypeptide comprising a sequence 
essentially identical to the anticouplone sequence of G D 
(SEQ ID NO: 3); and 

a means for detecting either (a) the association 
of said first polypeptide with said second polypeptide, 
or (b) the activation of said second polypeptide by said 
first polypeptide. 



15 



20 



25 
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17. A cell-free system for screening candidate 
Alzheimer's disease therapeutics, which system comprises 

a first polypeptide comprising a sequence 
essentially identical to that of peptide 20 (SBQ ID 

5 NO: 1); and 

a second polypeptide comprising a sequence 
essentially identical to the anticouplone sequence of G Q 
(SEQ ID NO: 3) . 

18. The system of claim 17 , wherein said first 
10 polypeptide is anchored to a solid material or is in a 

phospholipid vesicle* 

4 

19 . The system of claim 17 , wherein said second 
polypeptide further comprises residues 1 to 3 (SEQ ID 
NO: 4) and 19 to 36 (SEQ ID NO: 5) of G Q . 

4 

15 20 • The system of claim 19, wherein said second 

polypeptide comprises G 0 1 or G Q 2. 

a 

21 • A method for diminishing the activation of G Q 
in a neuronal cell by treating the cell with a compound 
which blocks association of G 0 with the cytoplasmic tail 

20 Of APP. 

22. The method of claim 21, wherein the compound 
is a peptide fragment of G Q or of the cytoplasmic tail of 
APP. 

23. The method of claim 21, wherein said cell is 
25 within an animal. 

24. The method of claim 23, wherein said animal 
is a human. 
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25. A method for preventing or treating 
Alzheimer's disease in a patient, .comprising treating the 
patient with a compound which blocks association of G D 
with the cytoplasmic tail of APP. 

5 26. A method for preventing or treating 

Alzheimer's disease in a patient, comprising treating the 
patient with a compound which inhibits activation of 
neuronal G G by the cytoplasmic tail of APP. 

* 

27. A peptide having less than 50 amino acids and 
10 comprising the sequence of peptide 20 (SEQ ID NO: 1) . 

28. A therapeutic composition comprising the 
peptide of claim 27 and a pharmaceutical ly acceptable 
carrier. 

* 

29. A method for identifying a ligand for which 
15 APP is a receptor, which method includes the steps of 

providing an APP molecule and a G 0 molecule; 

contacting a candidate compound with the 
extracellular domain of said APP molecule, the 
cytoplasmic tail of said APP molecule being accessible to 
20 said G Q molecule, and 

detecting either (a) association of said G 0 
molecule with said APP molecule, or (b) activation of 
said G D molecule by said APP molecule, said association 
or activation being evidence that said candidate compound 
25 is a ligand of APP. 
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